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(54) Speech coding apparatus and speech decoding apparatus 



(57) A speech coding apparatus includes a spec- 
trum parameter calculation section (200,210), an adap- 
tive codebook section (500), a sound source 
quantization section (350;355;356;357,366), a discrimi- 
nation section (370), and a multiplexer section (400). 
The spectrum parameter calculation section (200,210) 
receives a speech signal and quantizes a spectrum 
parameter. The adaptive codebook section (500) 
obtains a delay and a gain from a past quantized sound 
source signal using an adaptive codebook, and obtains 
a residue by predicting a speech signal. The sound 
source quantization section (350 ;355; 356*357,366) 
quantizes a sound source signal using the spectrum 
parameter. The discrimination section (370) discrimi- 
nates the mode. The sound source quantization section 



(350;355;356;357,366) has a codebook (351,352) for 
representing a sound source signal by a combination of 
non-zero pulses and collectively quantizing amplitudes 
or polarities of the pulses in a predetermined mode, and 
searches combinations of code vectors and shift 
amounts used to shift the positions of the pulses to out- 
put a combination of a code vector and shift amount 
which minimizes distortion relative to input speech. The 
multiplexer section (400) outputs a combination of 
outputs from the spectrum parameter calculation sec- 
tion (200,210). the adaptive codebook section (500), 
and the sound source quantization section 
(350;355;356;357.366). 
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Description 

BACKGROUND OF THE INVENTION 

5 FIELD OF THE INVENTION: 

[0001] The present invention relates to a speech coding apparatus and speech decoding apparatus and, more par- 
ticularly, to a speech coding apparatus for coding a speech signal at a tow bit rate with high quality. 

w DESCRIPTION OF THE PRIOR ART: 

[0002] As a conventional method of coding a speech signal with high efficiency, CELP (Code Excited Linear Predictive 
Coding) is Known, which is disclosed, for example, in M. Schroeder and B. Atal, "Code-excited linear prediction: High 
quality speech at low bit rates", Proc. ICASSP, 1 985, pp. 937-940 (reference 1) and Kleijn et al., "Improved speech qual- 

15 ity and efficient vector quantization in SELP", Proc. ICASSP, 1988, pp. 155-158 (reference 2). 

[0003] In this CELP coding scheme, on the transmission side, spectrum parameters representing a spectrum char- 
acteristic of a speech signal are extracted from the speech signal for each frame (for example, 20 ms) using linear pre- 
dictive coding (LPC) analysis. Each frame is divided into subframes (for example, of 5 ms), and for each subframe, 
parameters for an adaptive codebook (a delay parameter and a gain parameter corresponding to the pitch period) are 

20 extracted based on the sound source signal in the past and then the speech signal of the subframe is pitch predicted 
using the adaptive codebook. 

[0004] With respect to the sound source signal obtained by the pitch prediction, an optimum sound source code vector 
is selected from a sound source codebook (vector quantization codebook) consisting of predetermined types of noise 
signals, and an optimum gain is calculated to quantize the sound source signal. 

25 [0005] The selection of a sound source code vector is performed so as to minimize the error power between a signal 
synthesized based on the selected noise signal and the residue signal. Then, an index and a gain representing the kind 
of the selected code vector as well as the spectrum parameter and the parameters of the adaptive codebook are com- 
bined and transmitted by a multiplexer section. A description of the operation of the reception side will be omitted. 
[0006] The conventional coding scheme described above is disadvantageous in that a large calculation amount is 

30 required to select an optimum sound source code vector from a sound source codebook. 

[0007] This arises from the fact that, in the methods in references 1 and 2, in order to select a sound source code 
vector, filtering or convolution calculation is performed once for each code vectors, and such calculation is repeated by 
a number of times equal to the number of code vectors stored in the codebook. 

[0008] Assume that the number of bits of the codebook is B and the order is N. In this case, if the filter or impulse 
35 response length in filtering or convolution calculation is K, the calculation amount required is N x K x 2B x 8000 per 
second. As an example, if B=10, N=40 and k=10 t 81 ,920.000 calculations are required per second. In this manner, the 
conventional coding scheme is disadvantageous in that it requires a very large calculation size. 
[0009] Various methods which reduce the calculation amount required to search a sound source codebook have been 
proposed. One of the methods is an ACELP (Algebraic Code Excited Linear Prediction) method, which is disclosed, for 
40 example, in C. Laflamme et al., "16 kbps wideband speech coding technique based on algebraic CELP", Proc. ICASSP, 
1991, pp. 13-1 6 (reference 3). 

[001 0] According to the method disclosed in reference 3, a sound source signal is represented by a plurality of pulses 
and transmitted while the positions of the respective pulses are represented by predetermined numbers of bits. In this 
case, since the amplitude of each pulse is limited to +1 .0 or -1 .0, the calculation amount required to search pulses can 
45 be greatly reduced. 

[001 1] As described above, according to the method disclosed in reference 3, a great reduction in calculation amount 
can be attained. 

[0012] Another problem is that at a bit rate less than 8 kb/s, especially when background noise is superimposed on 
speech, the background noise portion of the coded speech greatly deteriorates in sound quality, although the sound 

so quality is good at 8 kb/s or higher. 

[0013] Such a problem arises for the following reason. Since a sound source is represented by a combination of a 
plurality of pulses, pulses concentrate near a pitch pulse as the start point of a pitch in a vowel interval of speech. This 
signal can therefore be efficiently expressed by a small number of pulses. For a random signal like background noise, 
however, pulses must be randomly generated, and hence the background noise cannot be properly expressed by a 

55 small number of pulses. As a consequence, if the bit rate decreases, and the number of pulses decreases, the sound 
quality of background noise abruptly deteriorates. 
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SUMMARY OF THE INVENTION 

[0014] The present invention has been made in consideration of the above situation in the prior art, and has as its 
object to provide a speech coding system which can solve the above problems and suppress a deterioration in sound 

5 quality in terms of background noise, in particular, with a relatively small calculation amount 

[0015] In order to achieve the above object, a speech coding apparatus according to the first aspect of the present 
invention including a spectrum parameter calculation section for receiving a speech signal, obtaining a spectrum 
parameter, and quantizing the spectrum parameter, an adaptive codebook section tor obtaining a delay and a gain from 
a past quantized sound source signal by using an adaptive codebook, and obtaining a residue by predicting a speech 

io signal, and a sound source quantization section for quantizing a sound source signal of the speech signal by using the 
spectrum parameter and outputting the sound source signal is characterized by comprising a discrimination section for 
discriminating a mode on the basis of a past quantized gain of an adaptive codebook, a sound source quantization sec- 
tion which has a codebook for representing a sound source signal by a combination of a plurality of non-zero pulses 
and collectively quantizing amplitudes or polarities of the pulses when an output from the discrimination section indi- 

is cates a predetermined mode, and searches combinations of code vectors stored in the codebook and a plurality of shift 
amounts used to shift positions of the pulses so as to output a combination of a code vector and shift amount which 
minimizes distortion relative to input speech, and a multiplexer section for outputting a combination of an output from 
the spectrum parameter calculation section, an output from the adaptive codebook section, and an output from the 
sound source quantization section. 

20 [0016] A speech coding apparatus according to the second aspect of the present invention including a spectrum 
parameter calculation section for receiving a speech signal, obtaining a spectrum parameter, and quantizing the spec- 
trum parameter, an adaptive codebook section for obtaining a delay and a gain from a past quantized sound source sig- 
nal by using an adaptive codebook, and obtaining a residue by predicting a speech signal, and a sound source 
quantization section for quantizing a sound source signal of the speech signal by using the spectrum parameter and 

25 outputting the sound source signal, is characterized by comprising a discrimination section for discriminating a mode 
on the basis of a past quantized gain of an adaptive codebook. a sound source quantization section which has a code- 
book for representing a sound source signal by a combination of a plurality of non-zero pulses and collectively quantiz- 
ing amplitudes or polarities of the pulses when an output from the discrimination section indicates a predetermined 
mode, and outputs a code vector that minimizes distortion relative to input speech by generating positions of the pulses 

30 according to a predetermined rule, and a multiplexer section for outputting a combination of an output from the spec- 
trum parameter calculation section, an output from the adaptive codebook section, and an output from the sound 
source quantization section. 

[001 7] A speech coding apparatus according to the third aspect of the present invention including a spectrum param- 
eter calculation section for receiving a speech signal, obtaining a spectrum parameter, and quantizing the spectrum 

35 parameter, an adaptive codebook section for obtaining a delay and a gain from a past quantized sound source signal 
by using an adaptive codebook, and obtaining a residue by predicting a speech signal, and a sound source quantization 
section for quantizing a sound source signal of the speech signal by using the spectrum parameter and outputting the 
sound source signal is characterized by comprising a discrimination section for discriminating a mode on the basis of a 
past quantized gain of an adaptive codebook, a sound source quantization section which has a codebook for represent- 

40 ing a sound source signal by a combination of a plurality of non-zero pulses and collectively quantizing amplitudes or 
polarities of the pulses when an output from the discrimination section indicates a predetermined mode, and a gain 
codebook for quantizing gains, and searches combinations of code vectors stored in the codebook, a plurality of shrft 
amounts used to shift positions of the pulses, and gain code vectors stored in the gain codebook so as to output a com- 
bination of a code vector, shift amount, and gain code vector which minimizes distortion relative to input speech, and a 

45 multiplexer section for outputting a combination of an output from the spectrum parameter calculation section, an output 
from the adaptive codebook section, and an output from the sound source quantization section. 
[0018] A speech coding apparatus according to the fourth aspect of the present invention including a spectrum 
parameter calculation section for receiving a speech signal, obtaining a spectrum parameter, and quantizing the spec- 
trum parameter, an adaptive codebook section for obtaining a delay and a gain from a past quantized sound source sig- 

50 nal by using an adaptive codebook, and obtaining a residue by predicting a speech signal, and a sound source 
quantization section for quantizing a sound source signal of the speech signal by using the spectrum parameter and 
outputting the sound source signal is characterized by comprising a discrimination section for discriminating a mode on 
the basis of a past quantized gain of an adaptive codebook, a sound source quantization section which has a codebook 
for representing a sound source signal by a combination of a plurality of non-zero pulses and collectively quantizing 

55 amplitudes or polarities of the pulses when an output from the discrimination section indicates a predetermined mode, 
and a gain codebook for quantizing gains, and outputs a combination of a code vector and gain code vector which min- 
imizes distortion relative to input speech by generating positions of the pulses according to a predetermined rule, and 
a multiplexer section for outputting a combination of an output from the spectrum parameter calculation section, an out- 
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put from the adaptive codebook section, and an output from the sound source quantization section. 
[0019] A speech decoding apparatus according to the fifth aspect of the present invention is characterized by com- 
prising a demultiplexer section for receiving and demultiplexing a spectrum parameter, a delay of an adaptive codebook, 
a quantized gain, and quantized sound source information, a mode discrimination section for discriminating a mode by 
5 using a past quantized gain in the adaptive codebook, and a sound source signal reconstructing section for reconstruct- 
ing a sound source signal by generating non-zero pulses from the quantized sound source information when an output 
from the discrimination section indicates a predetermined mode, wherein a speech signal is reproduced by passing the 
sound source signal through a synthesis filter section constituted by spectrum parameters. 

[0020] As is obvious from the above aspects, according to the present invention, the mode is discriminated on the 
10 basis of the past quantized gain of the adaptive codebook. If a predetermined mode is discriminated, combinations of 
code vectors stored in the codebook, which is used to collectively quantize the amplitudes or polarities of a plurality of 
pulses, and a plurality of shift amounts used to temporally shift predetermined pulse positions are searched to select a 
combination of a code vector and shift amount which minimizes distortion relative to input speech. With this arrange- 
ment, even if the bit rate is low, a background noise portion can be properly coded with a relatively small amount calcu- 
75 lation amount. 

[0021] In addition, according to the present invention, a combination of a code vector, shift amount, and gain code 
vector which minimizes distortion relative to input speech is selected by searching combinations of code vectors, a plu- 
rality of shift amounts, and gain code vectors stored in the gain codebook for quantizing gains. With this operation, even 
if speech on which background noise is superimposed is coded at a low bit rate, a background noise portion can be 
20 properly coded. 

[0022] The above and many other objects, features and advantages of the present invention will become manifest to 
those skilled in the art upon making reference to the following detailed description and accompanying drawings in which 
preferred embodiments incorporating the principles of the present invention are shown by way of illustrative examples. 



25 BRIEF DESCRIPTION OF THE DRAWINGS 



[0023] 



Fig. 1 is a block diagram showing the schematic arrangement of the first embodiment of the present invention; 
30 Fig. 2 is a block diagram showing the schematic arrangement of the second embodiment of the present invention; 
Fig. 3 is a block diagram showing the schematic arrangement of the third embodiment of the present invention; 
Fig. 4 is a block diagram showing the schematic arrangement of the fourth embodiment of the present invention; 
and 

Fig. 5 is a block diagram showing the schematic arrangement of the fifth embodiment of the present invention. 

35 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 



[0024] Several embodiments of the present invention will be described below with reference to the accompanying 
drawings. In a speech coding apparatus according to an embodiment of the present invention, a mode discrimination 

40 circuit (370 in Fig. 1 ) discriminates the mode on the basis of the past quantized gain of an adaptive codebook. When a 
predetermined mode is discriminated, a sound source quantization circuit (350 in Fig. 1) searches combinations of 
code vectors stored in a codebook (351 or 352 in Fig. 1), which is used to collectively quantize the amplitudes or polar- 
ities of a plurality of pulses, and a plurality of shift amounts used to temporally shift predetermined pulse positions, to 
select a combination of a code vector and shift amount which minimizes distortion relative to input speech. A gain quan- 

45 tization circuit (365 in Fig. 1) quantizes gains by using a gain codebook (380 in Fig. 1). 

[0025] According to a preferred embodiment of the present invention, a speech decoding apparatus includes a demul- 
tiplexer section (510 in Fig. 5) for receiving and demultiplexing a spectrum parameter, a delay of an adaptive codebook, 
a quantized gain, and quantized sound source information, a mode discrimination section (530 in Fig. 5) for discriminat- 
ing the mode on the basis of the past quantized gain of the adaptive codebook, and a sound source decoding section 

so (540 in Fig. 5) for reconstructing a sound source signal by generating non-zero pulses from the quantized sound source 
information. A speech signal is reproduced or resynthesized by passing the sound source signal through a synthesis 
fitter (560 in Fig. 5) defined by spectrum parameters. 

[0026] According to a preferred embodiment of the present invention, a speech coding apparatus according to the 
first aspect of the present invention includes a spectrum parameter calculation section for receiving a speech signal. 
55 obtaining a spectrum parameter, and quantizing the spectrum parameter, an adaptive codebook section for obtaining a 
delay and a gain from a past quantized sound source signal by using an adaptive codebook, and obtaining a residue 
by predicting a speech signal, and a sound source quantization section for quantizing a sound source signal of the 
speech signal by using the spectrum parameter and outputting the sound source signal is characterized by comprising 
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a discrimination section or discriminating a mode on the basis of a past quantized gain of an adaptive codebook, a 
sound source quantization section which has a codebook for representing a sound source signal by a combination of a 
plurality of non-zero pulses and collectively quantizing amplitudes or polarities of the pulses when an output from the 
discrimination section indicates a predetermined mode, and searches combinations of code vectors stored in the code- 

5 book and a plurality of shift amounts used to shift positions of the pulses so as to output a combination of a code vector 
and shift amount which minimizes distortion relative to input speech, and a multiplexer section for outputting a combi- 
nation of an output from the spectrum parameter calculation section, an output from the adaptive codebook section, an 
output from the sound source quantization section, a demultiplexer section for receiving and demultiplexing a spectrum 
parameter, a delay of an adaptive codebook, a quantized gain, and quantized sound source information, a mode dis- 

10 crimination section for discriminating a mode by using a past quantized gain in the adaptive codebook, and a sound 
source signal reconstructing section for reconstructing a sound source signal by generating non-zero pulses from the 
quantized sound source information when an output from the discrimination section indicates a predetermined mode. 
A speech signal is reproduced by passing the sound source signal through a synthesis filter section constituted by 
spectrum parameters. 

75 [0027] A speech coding apparatus according to the present invention includes a spectrum parameter calculation sec- 
tion for receiving a speech signal, obtaining a spectrum parameter, and quantizing the spectrum parameter, an adaptive 
codebook section for obtaining a delay and a gain from a past quantized sound source signal by using an adaptive 
codebook. and obtaining a residue by predicting a speech signal, and a sound source quantization section for quantiz- 
ing a sound source signal of the speech signal by using the spectrum parameter and outputting the sound source sig- 

20 nal, is characterized by comprising a discrimination section for discriminating a mode on the basis of a past quantized 
gain of an adaptive codebook, a sound source quantization section which has a codebook for representing a sound 
source signal by a combination of a plurality of non-zero pulses and collectively quantizing amplitudes or polarities of 
the pulses when an output from the discrimination section indicates a predetermined mode, and outputs a code vector 
that minimizes distortion relative to input speech by generating positions of the pulses according to a predetermined 

25 rule, and a multiplexer section for outputting a combination of an output from the spectrum parameter calculation sec- 
tion, an output from the adaptive codebook section, an output from the sound source quantization section, a demulti- 
plexer section for receiving and demultiplexing a spectrum parameter, a delay of an adaptive codebook, a quantized 
gain, and quantized sound source information, a mode discrimination section for discriminating a mode by using a past 
quantized gain in the adaptive codebook, and a sound source signal reconstructing section for reconstructing a sound 

30 source signal by generating pulse positions according to a predetermined rule and generating amplitudes or polarities 
for the pulses from a code vector to generate a sound source signal when the output from the discrimination section 
indicates a predetermined mode. A speech signal is reproduced by passing the sound source signal through a synthe- 
sis filter section constituted by spectrum parameters. 

35 Fir s t Em b odim ent : 

[0028] Fig. 1 is a block diagram showing the arrangement of a speech coding apparatus according to an embodiment 
of the present invention. 

[0029] Referring to Fig. 1, when a speech signal is input through an input terminal 100, a frame division circuit 110 
40 divides the speech signal into frames (for example, of 20 ms). A subframe division circuit 120 divides the speech signal 
of each frame into subframes (for example, of 5 ms) shorter than the frames. 

[0030] A spectrum parameter calculation circuit 200 extracts speech from the speech signal of at least one subframe 
using a window (for example, of 24 ms) longer than the subframe length and calculates spectrum parameters by com- 
putations of a predetermined order (for example, P = 10). In this case, for the calculation of spectrum parameters, an 
45 LPC analysis, a Burg analysis, and the like which are well known in the art can be used. In this case, the Burg analysts 
is used. Since the Burg analysis is disclosed in detail in Nakamizo, "Signal Analysis and System Identification", Corona, 
1988, pp. 82 - 87 (reference 4), a description thereof will be omitted. 

[0031] In addition, a spectrum parameter calculation circuit 210 transforms linear predictive coefficients a il (i=1 
10) calculated using the Burg method into LSP parameters suitable for quantization and interpolation. Such transforma- 
50 tion from linear predictive coefficients into LSP parameters is disclosed in Sugamura et al., "Speech Data Compression 
by LSP Speech Analysis-Synthesis Technique", Journal of the Electronic Communications Society of Japan, J64-A, 
1981, pp. 599-606 (references). 

[0032] For example, linear predictive coefficients calculated for the second and fourth subframes based on the Burg 
method are transformed into LSP parameters whereas LSP parameters of the first and third subframes are determined 
55 by linear interpolation, and the LSP parameters of the first and third subframes are inversely transformed into linear pre- 
dictive coefficients. Then, the linear predictive coefficients a il 0=1,.... 10, 1=1 5) of the first to fourth subframes are 

output to a perceptual weighting circuit 230. The LSP parameters of the fourth subframe are output to the spectrum 
parameter quantization circuit 210. 
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[0033] The spectrum parameter quantization circuit 210 efficiently quantizes the LSP parameters of a predetermined 
subframe from the spectrum parameters and outputs a quantization value which minimizes the distortion given by: 

5 Dj = £ W(i)[LSP(i) - QLSP(i)j] 2 (1) 

i=1 

where LSP(i), QLSP(i)j, and W(i) are the LSP parameter of the ith-order before quantization, the jth result after the 
quantization, and the weighting coefficient, respectively. 

10 [0034] In the following description, it is assumed that vector quantization is used as a quantization method, and LSP 
parameters of the fourth subframe are quantized. Any known technique can be employed as the technique for vector 
quantization of LSP parameters. More specifically, a technique disclosed in, for example, Japanese Unexamined Patent 
Publication No. 4-171500 (Japanese Patent Application No. 2-297600) (reference 6), Japanese Unexamined Patent 
Publication No. 4-363000 (Japanese Patent Application No. 3-261925) (reference 7), Japanese Unexamined Patent 

is Publication No. 5-6199 (Japanese Patent Application No. 3-155049) (reference 8), T. Nomura et al., "LSP Coding VQ- 
SVQ with Interpolation in 4.075 kbps M-LCELP Speech Coder", Proc. Mobile Multimedia Communications, 1993, pp. 
B.2.5 (reference 9) or the like can be used. Accordingly, a description of details of the technique is omitted herein. 
[0035] The spectrum parameter quantization circuit 210 reconstructs the LSP parameters of the first to fourth sub- 
frames based on the LSP parameters quantized with the fourth subframe. Here, linear interpolation of the quantization 

20 LSP parameters of the fourth subframe of the current frame and the quantization LSP parameters of the fourth sub- 
frame of the immediately preceding frame is performed to reconstruct LSP parameters of the first to third subframes. 
[0036] In this case, after a code vector which minimizes the error power between the LSP parameters before quanti- 
zation and the LSP parameters after quantization is selected, the LSP parameters of the first to fourth subframes are 
reconstructed by linear interpolation. In order to further improve the performance, after a plurality of candidates are first 

25 selected as a code vector which minimizes the error power, the accumulated distortion may be evaluated with regard to 
each of the candidates to select a set of a candidate and an interpolation LSP parameter which exhibit a minimum accu- 
mulated distortion. The details of this technique are disclosed, for example, in Japanese Unexamined Patent Publica- 
tion No. 6-222797 (reference 10). 

[0037] The LSP parameters of the first to third subframes reconstructed in such a manner as described above and 
30 the quantization LSP parameters of the fourth subframe are transformed into linear predictive coefficients a il (i=1 

10, 1=1 5) for each subframe, and the linear predictive coefficients are output to the impulse response calculation 

circuit 310. Furthermore, an index representing the code vector of the quantization LSP parameters of the fourth sub- 
frame is output to a multiplexer 400. 

[0038] The perceptual weighting circuit 230 receives the linear predictive coefficients a il (i=1 10. 1 =1 5) before 

35 quantization for each subframe from the spectrum parameter calculation circuit 200, performs perceptual weighting for 
the speech signal of the subframe on the basis of the method described in reference 1 and outputs a resultant percep- 
tual weighting signal. 

[0039] A response signal calculation circuit 240 receives the linear predictive coefficients a il for each subframe from 
the spectrum parameter calculation circuit 200, receives the linear predictive coefficients a il reconstructed by quanti- 
40 zation and interpolation for each subframe from the spectrum parameter quantization circuit 210, calculates, for one 
subframe, a response signal with which the input signal is reduced to zero d(n)=0 using a value stored in an interval 
filter memory, and outputs the response signal to a subtracter 235. In this case, the response signal x 2 (n) is represented 
by: 

45 10 10 10 

x 2 (n)=d(n)-£ aj d(n - i)£ a^'yCn - i) + £ a^x^n - i) (2) 
i=1 i=1 i=1 



so [0040] If n - i £ 0 , then 

y(n-i)=p(N+(n-i)) (3) 
* 2 (n-i) = s w (N + (n-i)) (4) 

55 

where N is the subframe length, y is the weighting coefficient for controlling the perceptual weighting amount and has a 
value equal to the value of equation (7) given below, and s^n) and p(n) are an output signal of a weighting signal cal- 
culation circuit 360 and an output signal of the term of the denominator of a filter described by the first term of the right 
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side of equation (7), respectively. 

[0041] The subtracter 235 subtracts response signals x2(n) corresponding to one subframe from the perceptual 
weighting signal x^n) by: 

5 x w (n)=x w (n) - x x (n) (5) 

and outputs a signal x' w (n) to an adaptive codebook circuit 500. 

[0042] The impulse response calculation circuit 310 calculates only a predetermined number L of impulse responses 
h^n) of a perceptual weighting filter H(z) whose z-transform (transfer function) is represented by: 

10 

10 

H W (Z) = — n W ,5 (6) 

l-^a.Y'z-'l-^OY'Z- 

i=1 1=1 



and outputs them to the adaptive codebook circuit 500 and a sound source quantization circuit 350. 
[0043] The adaptive codebook circuit 500 receives a sound source signal v(n) in the past from a gain quantization 
20 circuit 366, receives the output signal x' w (n) from the subtracter 235 and the impulse responses h^n) from the impulse 
response calculation circuit 310. Then, the adaptive codebook circuit 500 calculates a delay DT corresponding to the 
pitch, which minimizes the distortion given by: 



25 



30 



D T = £ 4 (n)-[ £ x w (n)y w (n - T)] 2 /[^ y 2 w (n - T)l (7) 

n=0 Ln=0 J U=0 J 



fory w (n-T) = v(n - T)*h w (n) (8) 

and outputs an index representing the delay to the multiplexer 400. : 
where the symbol * signifies a convolution calculation. '4 
[0044] A gain p is obtained by: 



35 N-1 N-1 

2 



p = £ *w (n)y w (n - T)/£ (n - T) (9) 

n=0 n=0 



40 [0045] In this case, in order to improve the extraction accuracy of a delay for the voice of a woman or a child, the delay 
may be calculated not as an integer sample value but a decimal fraction sample value. A detailed method is disclosed, 
for example, in P. Kroon et. al., "Pitch predictors with high terminal resolution", Proc. ICASSP, 1990, pp.661 -664 (refer- 
ence 11). 

[0046] In addition, the adaptive codebook circuit 500 performs pitch prediction:. 

45 

e w (n) = ^(n) - pv(n - T) * h w (n) (10) 



and outputs a resultant predictive residue signal e^n) to the sound source quantization circuit 350. 

so [0047] A mode discrimination circuit 370 receives the adaptive codebook gain § quantized by the gain quantization 
circuit 366 one subframe ahead of the current subframe, and compares it with a predetermined threshold Th to perform 
voiced/unvoiced determination. More specifically, if p is larger than the threshold Th, a voiced sound is determined. If p 
is smaller than the threshold Th, an unvoiced sound is determined. The mode discrimination circuit 370 then outputs a 
voiced/unvoiced discrimination information to the sound source quantization circuit 350. the gain quantization circuit 

55 366, and the weighting signal calculation circuit 360. 

[0048] The sound source quantization circuit 350 receives the voiced/unvoiced discrimination information and 
switches pulses depending on whether a voiced or an unvoiced sound is determined. 
[0049] Assume that M pulses are generated for a voiced sound. 
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[0050] For a voiced sound, a B-bit amplitude codebook or polarity codebook is used to collectively quantize the ampli- 
tudes of pulses in units of M pulses. A case wherein the polarity codebook is used will be described below. This polarity 
codebook is stored in a codebook 351 for a voiced sound, and is store din a codebook 352 for an unvoiced sound. 
[0051] For a voiced sound, the sound source quantization circuit 350 reads out polarity code vectors from the code- 
book 351 , assigns positions to the respective code vectors, and selects a combination of a code vector and a position 
which minimizes the distortion given by: 



10 



N-1 

£ 

n=0 



e w( n )-£9 ik h w( n - m i) 
i=1 



(11) 



where h w (n) is the perceptual weighting impulse response. 

[0052] Equation (11) can be minimized by obtaining a combination of an amplitude code vector k and a position mi 
75 which maximizes ,j given by: 



' (Kj) 



i-N-1 -| N-1 

£ ejnjs^nrij) 2 /£ 4(m,) 

Ln=0 -I n=0 



(12) 



where s^mi) is calculated according to equation (5) above. 
[0053] Alternatively, a combination which maximizes D (k j^: 



25 



r-N-1 -t N-1 

D (KJ)= £ <Mn)v k (n) 2 /£ 4 k ( mj ) 

Lrh=0 J n=0 



(13) 



30 



N-1 



for <(>(n) = £ e w(')h w (i - n), n = 0.....N - 1 



(14) 



may be selected. The calculation amount required for the numerator is smaller in this operation than in the above oper- 
35 ation. 

[0054] In this case, to reduce the calculation amount, the positions that the respective pulses can assume for a voiced 
sound can be limited as in reference 3. If, for example, N = 40 and M = 5, the possible positions of the respective pulses 
are given by Table 1 . 



40 



45 









Table 1 






0, 


5, 


10, 


15, 20, 25, 


30, 


35 


1. 


6. 


11. 


16,21,26. 


31, 


36 


2, 


6, 


12, 


17, 22, 27, 


32, 


37 


3, 


8, 


13. 


18,23,28. 


33, 


38 


4, 


9, 


14, 


19, 24, 29. 


34, 


39 



50 



55 



[0055] An index representing a code vector is then output to the multiplexer 400. 

[0056] Furthermore, a pulse position is quantized with a predetermined number of bits, and an index representing the 
position is output to the multiplexer 400. 

[0057] For unvoiced periods, as indicated by Table 2. pulse positions are set at predetermined intervals, and shift 
amounts for shifting the positions of all pulses are determined in advance. In the following case, the pulse positions are 
shifted in units of samples, and fourth types of shift amounts (shift 0, shift 1, shift 2, and shift 3) can be used. In this 
case, the shift amounts are quantized with two bits and transmitted. 



BNSDOCID: <EP 09S7472A2_I_> 



8 



EPO 957 472 A2 

Table 2 
Pulse Position 



0, 4, 8, 12, 16. 20, 24, 28.. 



w 



[0058] The sound source quantization circuit 350 further receives polarity code vectors from the polarity codebook 
(sound source codebook) 352, and searches combinations of all shift amounts and all code vectors to select a combi- 
nation of a shift amount 5 Q) and a code vector gk which minimizes the distortion given by: 



75 



N-1 p M -i 



(15) 



[0059] An index representing the selected code vector and a code representing the selected shift amount are sent to 
the multiplexer 400. 

[0060] Note that a codebook for quantizing the amplitudes of a plurality of pulses can be learnt in advance by using 
20 speech signals and stored. A learning method for the codebook is disclosed, for example, in "An algorithm for vector 
quantization design", IEEE Trans. Commun., January 1980, pp.84-95) (reference 12). 

[0061 ] The information of amplitudes and positions of voiced and unvoiced periods are output to the gain quantization 
circuit 366. 

[0062] The gain quantization circuit 366 receives the amplitude and position information from the sound source quan- 
25 tization circuit 350. and receives the voiced/unvoiced discrimination information from the mode discrimination circuit 
370. 

[0063] The gain quantization circuit 366 reads out gain code vectors from a gain codebook 380 and selects one gain 
code vector that minimizes equation (16) below for the selected amplitude code vector or polarity code vector and the 
position. Assume that both the gain of the adaptive codebook and the sound source gain represented by a pulse are 
so vector quantized simultaneously. 

[0064] When the discrimination information indicates a voiced sound, a gain code vector is obtained to minimize D k 
given by: 



35 



N-l r M 

D k = L x w(") - K v (n " * h w(") - G - £ 9 ik h w (n - m f ) 

n=0L ' t=1 



(16) 



where pk and Gk are kth code vectors in a two-dimensional gain codebook stored in the gain codebook 380. An index 
40 representing the selected gain code vector is output to the multiplexer 400. 

[0065] If the discrimination information indicates an unvoiced sound, a gain code vector is searched out which mini- 
mizes D k given by: 



45 



N-lj- 

■Er 

O=0 L 



N-l r M "I 

x w (n) - p v(n - T) * h w (n) - G£ g' ik h w (n -m x - 6(j)) 



(17) 



[0066] An index representing the selected gain code vector is output to the multiplexer 400. 
so [0067] The weighting signal calculation circuit 360 receives the voiced/unvoiced discrimination information and the 
respective indices and reads out the corresponding code vectors according to the indices. For a voiced sound, the driv- 
ing sound source signal v(n) is calculated by: 



55 



M 



v(n) = p v(n •T) + Gy g 5(n - m :) 

i tk ' 



(18) 
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[0068] This driving sound source signal v(n) is output to the adaptive codebook circuit 500. 
[0069] For an unvoiced sound, the driving sound source signal v(n) is calculated by: 



v(n) = p v(n - T) + G.£ 9^(n - m, - 5(i)) 



(19) 



[0070] This driving sound source signal v(n) is output to the adaptive codebook circuit 500. 
io [0071] Subsequently, the response signals s w (n) are calculated in units of subframes by using the output parameters 
from the spectrum parameter calculation circuit 200 and spectrum parameter calculation circuit 210 using 



15 



10 10 10 

s w (n) = v(n) - £ ajV(n - i) + £ a^'pCn - i) + £ a'.y's^n 

i=l i=1 i=1 



(20) 



and are output to the response signal calculation circuit 240. 
20 Second Embodiment 

[0072] Fig. 2 is a block diagram showing the schematic arrangement of the second embodiment of the present inven- 
tion. 

[0073] Referring to Fig. 2, the second embodiment of the present invention differs from the above embodiment in the 
operation of a sound source quantization circuit 355. More specifically, when voiced/unvoiced discrimination information 
indicates an unvoiced sound, the positions that are generated in advance in accordance with a predetermined rule are 
used as pulse positions. 

[0074] For example, a random number generating circuit 600 is used to generate a predetermined number of (e.g., 
M1) pulse positions That is, the M1 values generated by the random number generating circuit 600 are used as pulse 
positions. The M1 positions generated in this manner are output to the sound source quantization circuit 355. 
[0075] If the discrimination information indicates a voiced sound, the sound source quantization circuit 355 operates 
in the same manner as the sound source quantization circuit 350 in Fig. 1. If the information indicates an unvoiced 
sound, the amplitudes or polarities of pulses are collectively quantized by using a sound source codebook 352 in cor- 
respondence with the positions output from the random number generating circuit 600. 



25 



30 



35 



Third Embodiment 



[0076] Fig. 3 is a block diagram showing the arrangement of the third embodiment of the present invention. 
[0077] Referring to Fig. 3, in the third embodiment of the present invention, when voiced/unvoiced discrimination infor- 
40 mation indicates an unvoiced sound, a sound source quantization circuit 356 calculates the distortions given by equa- 
tions (21) below in correspondence with all the combinations of all the code vectors in a sound source codebook 352 
and the shift amounts of pulse positions, selects a plurality of combinations in the order which minimizes the distortions 
given by: 



45 



'kj : 



N-1 

E 

n=0 



e w (n)-£g jk h w (n-m r 5(j)) 



(21) 



so and outputs them to a gain quantization circuit 366. 

[0078] The gain quantization circuit 366 quantizes gains for a plurality of sets of outputs from the sound source quan- 
tization circuit 356 by using a gain codebook 380, and selects a combination of a shift amount, sound source code vec- 
tor, and gain code vector which minimizes distortions given by: 



55 



N-1 j- M -, 

d kj = £ x w (n) - p' jV (n - T) * h w (n) - Q^g^tn - M , ■ 6Q) 2 (22) 

n=oL i=1 J 
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Fourth Embodiment 

[0079] Fig. 4 is a block diagram showing the arrangement of the fourth embodiment of the present invention. 
[0080] Referring to Fig. 4, in the fourth embodiment of the present invention, when voiced/unvoiced discrimination 
s information indicates an unvoiced sound, a sound source quantization circuit 357 collectively quantizes the amplitudes 
or polarities of pulses for the pulse positions generated by a random number generating circuit 600 by using a sound 
source codebook 352, and outputs all the code vectors or a plurality of code vector candidates to a gain quantization 
circuit 367. 

[0081 ] The gain quantization circuit 367 quantizes gains for the respective candidates output from the sound source 
w quantization circuit 357 by using a gain codebook 380, and outputs a combination of a code vector and gain code vector 
which minimizes distortion. 

Fifth Embodiment 

15 [0082] Fig. 5 is a block diagram showing the arrangement of the fifth embodiment of the present invention. 

[0083] Referring to Fig. 1 5, in the fifth embodiment of the present invention, a demultiplexer section 510 demultiplexes 
a code sequence input through an input terminal 500 into a spectrum parameter, an adaptive codebook delay, an adap- 
tive codebook vector, a sound source gain, an amplitude or polarity code vector as sound source information, and a 
code representing a pulse position, and outputs them. 

20 [0084] The demultiplexer section 51 0 decodes the adaptive codebook and sound source gains by using a gain code- 
book 380 and outputs them. 

[0085] An adaptive codebook circuit 520 decodes the delay and adaptive codebook vector gains and generates an 

adaptive codebook reconstruction signal by using a synthesis filter input signal in a past subframe. 

[0086] A mode discrimination circuit 530 compares the adaptive codebook gain decoded in the past subframe with a 

25 predetermined threshold to discriminate whether the current subframe is voiced or unvoiced, and outputs the 
voiced/unvoiced discrimination information to a sound source signal reconstructing circuit 540. 
[0087] The sound source signal reconstructing circuit 540 receives the voiced/unvoiced discrimination information. If 
the information indicates a voiced sound, the sound source signal reconstructing circuit 540 decodes the pulse posi- 
tions, and reads out code vectors from a sound source codebook 351 . The circuit 540 then assigns amplitudes or polar- 

30 ities to the vectors to generate a predetermined number of pulses per subframe, thereby reclaiming a sound source 
signal. 

[0088] When the voiced/unvoiced discrimination information indicates an unvoiced sound, the sound source signal 
reconstructing circuit 540 reconstructs pulses from predetermined pulse positions, shift amounts, and amplitude or 
polarity code vectors. 

35 [0089] A spectrum parameter decoding circuit 570 decodes a spectrum parameter and outputs the resultant data to 
a synthesis filter 560 

[0090] An adder 550 adds the adaptive codebook output signal and the output signal from the sound source signal 
reconstructing circuit 540 and outputs the resultant signal to the synthesis filter 560. 

[0091] The synthesis filter 560 receives the output from the adder 550, reproduces speech, and outputs it from a ter- 
40 minal 580. 

Claims 

1 . A speech coding apparatus including at least 

45 

a spectrum parameter calculation section (200.210) for receiving a speech signal, obtaining a spectrum 
parameter, and quantizing the spectrum parameter, an adaptive codebook section (500) for obtaining a delay 
and a gain from a past quantized sound source signal by using an adaptive codebook, and obtaining a residue 
by predicting a speech signal, and 
so a sound source quantization section (350,366) for quantizing a sound source signal of the speech signal by 

using the spectrum parameter and outputting the sound source signal, comprising: 

a discrimination section (370) for discriminating a mode on the basis of a past quantized gain of an adaptive 
codebook; 

a sound source quantization section (350) which has a codebook (351,352) for representing a sound source 
55 signal by a combination of a plurality of non-zero pulses and collectively quantizing amplitudes or polarities of 

the pulses when an output from said discrimination section (370) indicates a predetermined mode, and 
searches combinations of code vectors stored in said codebook (351 ;352) and a plurality of shift amounts used 
to shift positions of the pulses so as to output a combination of a code vector and shift amount which minimizes 
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distortion relative to input speech; and 

a multiplexer section (400) for outputting a combination of an output from said spectrum parameter calculation 
section (200,21 0), an output from said adaptive codebook section (500), and an output from said sound source 
quantization section (350,366). 

5 

2. A speech coding apparatus including at least 

a spectrum parameter calculation section (200,210) for receiving a speech signal, obtaining a spectrum 
parameter, and quantizing the spectrum parameter, 
10 an adaptive codebook section (500) for obtaining a delay and a gain from a past quantized sound source signal 

by using an adaptive codebook, and obtaining a residue by predicting a speech signal, and 
a sound source quantization section (355,366) for quantizing a sound source signal of the speech signal by 
using the spectrum parameter and outputting the sound source signal, comprising: 

a discrimination section (370) lor discriminating a mode on the basis of a past quantized gain of an adaptive 
15 codebook; 

a sound source quantization section (355) which has a codebook (351,352) for representing a sound source 
signal by a combination of a plurality of non-zero pulses and collectively quantizing amplitudes or polarities of 
the pulses when an output from said discrimination section (370) indicates a predetermined mode, and outputs 
a code vector that minimizes distortion relative to input speech by generating positions of the pulses according 
20 to a predetermined rule; and 

a multiplexer section (400) for outputting a combination of an output from said spectrum parameter calculation 
section (200,210), an output from said adaptive codebook section (500), and an output from said sound source 
quantization section (355,366). 

25 3. A speech coding apparatus including at least 

a spectrum parameter calculation section (200,210) for receiving a speech signal, obtaining a spectrum 
parameter, and quantizing the spectrum parameter. 

an adaptive codebook section (500) for obtaining a delay and a gain from a past quantized sound source signal 
by using an adaptive codebook, and obtaining a residue by predicting a speech signal, and 
a sound source quantization section (356,366) for quantizing a sound source signal of the speech signal by 
using the spectrum parameter and outputting the sound source signal, comprising: 

a cfiscrimination section (370) for discriminating a mode on the basis of a past quantized gain of an adaptive 
codebook; 

a sound source quantization section (356,366) which has a codebook (351 ,352) for representing a sound 
source signal by a combination of a plurality of non-zero pulses and collectively quantizing amplitudes or polar- 
ities of the pulses when an output from said discrimination section (370) indicates a predetermined mode, and 
a gain codebook (380) for quantizing gains, and searches combinations of code vectors stored in said code- 
book (380), a plurality of shift amounts used to shift positions of the pulses, and gain code vectors stored in 
said gain code-book (380) so as to output a combination of a code vector, shift amount, and gain code vector 
which minimizes distortion relative to input speech; and 

a multiplexer section (400) for outputting a combination of an output from said spectrum parameter calculation 
section (200,210), an output from said adaptive codebook section (500), and an output from said sound source 
quantization section (356,366). 

A speech coding apparatus including at least 

a spectrum parameter calculation section (200,210) for receiving a speech signal, obtaining a spectrum 
parameter, and quantizing the spectrum parameter, 
so an adaptive codebook section (500) for obtaining a delay and a gain from a past quantized sound source signal 

by using an adaptive codebook, and obtaining a residue by predicting a speech signal, and 
a sound source quantization section (357,367) for quantizing a sound source signal of the speech signal by 
using the spectrum parameter and outputting the sound source signal, comprising: 

a discrimination section (370) for discriminating a mode on the basis of a past quantized gain of an adaptive 
55 codebook; 

a sound source quantization section (357) which has a codebook (351,352) for representing a sound source 
signal by a combination of a plurality of non-zero pulses and collectively quantizing amplitudes or polarities of 
the pulses when an output from said discrimination section (370) indicates a predetermined mode, and a gain 
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codebook (380) for quantizing gains, and outputs a combination of a code vector and gain code vector which 
minimizes distortion relative to input speech by generating positions of the pulses according to a predeter- 
mined rule; and 

a multiplexer section (400) for outputting a combination of an output from said spectrum parameter calculation 
section (200,210), an output from said adaptive codebook section (500), and an output from said sound source 
quantization section (357,367). 

5. A speech decoding apparatus comprising: 

a demultiplexer section (510) for receiving and demultiplexing a spectrum parameter, a delay of an adaptive 
codebook, a quantized gain, and quantized sound source information; 

a mode discrimination section (530) for discriminating a mode by using a past quantized gain in said adaptive 
codebook; and 

a sound source signal reconstructing section (540) for reconstructing a sound source signal by generating non- 
zero pulses from the quantized sound source information when an output from said discrimination section 
(530) indicates a predetermined mode, 

wherein a speech signal is reproduced by passing the sound source signal through a synthesis filter section 
(560) constituted by spectrum parameters. 

6. A speech coding/decoding apparatus comprising: 

a speech coding apparatus including 

a spectrum parameter calculation section (200,210) for receiving a speech signal, obtaining a spectrum 
parameter, and quantizing the spectrum parameter, 

an adaptive codebook section (500) for obtaining a delay and a gain from a past quantized sound source signal 
by using an adaptive codebook. and obtaining a residue by predicting a speech signal, 
a sound source quantization section (350,366) for quantizing a sound source signal of the speech signal by 
using the spectrum parameter and outputting the sound source signal, 

a discrimination section (370) for discriminating a mode on the basis of a past quantized gain of an adaptive 
codebook, and 

a codebook (351 ,352) for representing a sound source signal by a combination of a plurality of non-zero pulses 
and collectively quantizing amplitudes or polarities of the pulses when an output from said discrimination sec- 
tion (370) indicates a predetermined mode, 

said sound source quantization section (350) searching combinations of code vectors stored in said codebook 

(351 ,352) and a plurality of shift amounts used to shift positions of the pulses so as to output a combination of 

a code vector and shift amount which minimizes distortion relative to input speech, and further including 

a multiplexer section (400) for outputting a combination of an output from said spectrum parameter calculation 

section (200,210), an output from said adaptive codebook section (500), and an output from said sound source 

quantization section (350.366); and 

a speech decoding apparatus including at least 

a demultiplexer section (510) for receiving and demultiplexing a spectrum parameter, a delay of an adaptive 
codebook, a quantized gain, and quantized sound source information, 

a mode discrimination section (530) for discriminating a mode by using a past quantized gain in said adaptive 
codebook. 

a sound source signal reconstructing section (540) for reconstructing a sound source signal by generating non- 
zero pulses from the quantized sound source information when an output from said discrimination section 
(530) indicates a predetermined mode, and 

a synthesis filter section (560) which is constituted by spectrum parameters and reproduces a speech signal 
by filtering the sound source signal. 

7. A speech coding/decoding apparatus comprising: 

a speech coding apparatus including 

a spectrum parameter calculation section (200.210) for receiving a speech signal, obtaining a spectrum 
parameter, and quantizing the spectrum parameter, 

an adaptive codebook section (500) for obtaining a delay and a gain from a past quantized sound source signal 

by using an adaptive codebook. and obtaining a residue by predicting a speech signal. 

a sound source quantization section (355,366) for quantizing a sound source signal of the speech signal by 
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using the spectrum parameter and outputting the sound source signal, 

a discrimination section (370) tor discriminating a mode on the basis of a past quantized gain of an adaptive 
codebook, and 

a codebook (351 ,352) for representing a sound source signal by a combination of a plurality of non-zero pulses 
and collectively quantizing amplitudes or polarities of the pulses when an output from said discrimination sec- 
tion (370) indicates a predetermined mode, 

said sound source quantization section (355) for outputting a combination of a code vector and shift amount 
which minimizes distortion relative to input speech by generating positions of the pulses according to a prede- 
termined rule, and further including 

a multiplexer section (400) for outputting a combination of an output from said spectrum parameter calculation 
section (200.21 0), an output from said adaptive codebook section (500), and an output from said sound source 
quantization section (355,366); and 
a speech decoding apparatus including at least 

a demultiplexer section (510) for receiving and demultiplexing a spectrum parameter, a delay of an adaptive 
codebook, a quantized gain, and quantized sound source information, 

a mode discrimination section (530) for discriminating a mode by using a past quantized gain in said adaptive 
codebook, 

a sound source signal reconstructing section (540) for reconstructing a sound source signal by generating 
positions of pulses according to a predetermined rule and generating amplitudes or polarities for the pulses 
from a code vector when an output from said discrimination section (530) indicates a predetermined mode, and 
a synthesis filter section (560) which is constituted by spectrum parameters and reproduces a speech signal 
by filtering the sound source signal. 

8. A speech coding apparatus comprising: 

a spectrum parameter calculation section (200,210) for receiving a speech signal, obtaining a spectrum 
parameter, and quantizing the spectrum parameter; 

means (500) for obtaining a delay and a gain from a past quantized sound source signal by using an adaptive 
codebook, and obtaining a residue by predicting a speech signal; and 

mode discrimination means (370) for receiving a past quantized adaptive codebook gain and performs mode 
discrimination associated with a voiced/unvoiced mode by comparing the gain with a predetermined threshold, 
and 

further comprising: 

sound source quantization means (350,355) for quantizing a sound source signal of the speech signal by using 
the spectrum parameter and outputting the signal, and searching combinations of code vectors stored in a 
code-book for collectively quantizing amplitudes or polarities of a plurality of pulses in a predetermined mode 
and a plurality of shift amounts used to temporally shifting a predetermined pulse position so as to select a 
combination of an index of a code vector and a shift amount which minimizes distortion relative to input speech; 
gain quantization means (366) for quantizing a gain by using a gain codebook (380); and 
multiplex means (400) for outputting a combination of outputs from said spectrum parameter calculation means 
(200.210). said adaptive codebook means (500), said sound source quantization means (350,355), and said 
gain quantization means (366). 

9. An apparatus according to claim 8, wherein said sound source quantization means (350,355) uses a position gen- 
erated according to a predetermined rule as a pulse position when mode discrimination indicates a predetermined 
mode. 

10. An apparatus according to claim 9, wherein when mode discrimination indicates a predetermined mode, a prede- 
termined number of pulse positions are generated by random number generating means (600) and output to said 
sound source quantization means (350,355). 

11. An apparatus according to claim 8, wherein when mode discrimination indicates a predetermined mode, said 
sound source quantization means (350,355) selects a plurality of combinations from combinations of all code vec- 
tors in said codebook (351 ,352) and shift amounts for pulse positions in an order in which a predetermined distor- 
tion amount is minimized, and outputs the combinations to said gain quantization means (366), and 

said gain quantization means (366) quantizes a plurality of sets of outputs from said sound source quantization 
means (350,355) by using said gain codebook (380), and selects a combination of a shift amount, sound 
source code vector, and gain code vector which minimizes the predetermined distortion amount. 
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(54) Speech coding apparatus and speech decoding apparatus 



(57) A speech coding apparatus includes a spec- 
trum parameter calculation section (200,210), an adap- 
tive codebook section (500). a sound source 
quantization section (350;355;356;357,366), a discrimi- 
nation section (370), and a multiplexer section (400). 
The spectrum parameter calculation section (200,210) 
receives a speech signal and quantizes a spectrum 
parameter. The adaptive codebook section (500) 
obtains a delay and a gain from a past quantized sound 
source signal using an adaptive codebook, and obtains 
a residue by predicting a speech signal. The sound 
source quantization section (350;355;356;357,366) 
quantizes a sound source signal using the spectrum 
parameter. The discrimination section (370) discrimi- 
nates the mode. The sound source quantization section 



(350;355;356;357,366) has a codebook (351 ,352) for 
representing a sound source signal by a combination of 
non-zero pulses and collectively quantizing amplitudes 
or polarities of the pulses in a predetermined mode, and 
searches combinations of code vectors and shift 
amounts used to shift the positions of the pulses to out- 
put a combination of a code vector and shift amount 
which minimizes distortion relative to input speech. The 
multiplexer section (400) outputs a combination of out- 
puts from the spectrum parameter calculation section 
(200,210), the adaptive codebook section (500), and 
the sound source quantization section 
(350;355;356;357,366). 
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